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To characterize the pairing-specificity of RNA secondary structures as a fiinction of temperature, 
we analyse the statistics of the pairing weights as foUows : for each base (i) of the sequence of length 
N, we consider the (A'' — 1) pairing weights Wi{j) with the other bases {j =^ i) of the sequence. We nu- 
merically compute the probability distributions Pi(w) of the maximal weight wY^"'^ = maxj[wi{j)], 
the probability distribution 11(12) of the parameter ¥2(1) = w1(j), as well as the average values 
of the moments Yfc(j) = . Wi{j). We find that there are two important temperatures Tc < Tgap- 
For T > Tgap, the distribution Pi(w) vanishes at some value wo{T) < 1, and accordingly the mo- 
ments Yfc(i) decay exponentially as {wo{T))'' in k. For T < Tgap, the distributions Pi{w) and n(l2) 
present the characteristic Derrida-Flyvbjerg singularities at i« = 1/n and Y2 = 1/n for n = 1,2... 
In particular, there exists a temperature-dependent exponent //(T) that governs the singularities 
Pi{w) ~ (1 — to)'''"^'"^ and 11(12) ~ (1 — 1^2)^'""^'"^ as well as the power-law decay of the moments 
Yk{i) ~ l/k'^^'^K The exponent /i(r) grows from the value ^(T = 0) = up to fJ.{Tgap) ~ 2. The 
study of spatial properties indicates that the critical temperature Tc where the large-scale roughness 
exponent changes from the low temperature value ~ 0.67 to the high temperature value ~ 0.5 
corresponds to the exponent fi{Tc) = 1. For T < Tc, there exists frozen pairs of all sizes, whereas 
for Tc < T < Tgap, there exists frozen pairs, but only up to some characteristic length diverging as 
(,{T) ~ l/(Tc — T)" with ~ 2. The similarities and differences with the weight statistics in Levy 
sums and in Derrida's Random Energy Model are discussed. 

PACS numbers; 87.14.Gg; 87.15.Cc; 64.70.Pf 

I. INTRODUCTION 

Models of RNA secondary structures [l|, [1] have been recently studied by physicists [1, 0, H, H, 0, H, H, [13, [HI , 
because of the similarity of the low-temperature disorder-dominated phase with the spin glass phase. This phase 
has been analysed either in terms of replica theory of spin-glasses |12l| , with non-trivial overlap distribution function 
P{q) d, 0, 111 J or in terms of the droplet theory of spin-glasses [l3l| : in this case, a finite droplet exponent 9 > 
has been obtained via the e-coupling method, with finite values 9 ~ 0.23 d, , ^ = 1/3 [1] whereas a vanishing 
droplet exponent 9 = and logarithmic droplets have been found via the statistics of pinch free energiesfGllTlj. Many 
authors have also studied the phase transition towards the high-temperature molten phase, with different values for 
the correlation length exponent v and the specific heat exponent a ^ 2 — v. Numerical results have given for instance 
^~3.9 orz/~ 1.1 [9|, and the field theory of [To"! predicts v ~ 8/5 ~ 1.6, whereas the general theorem of fT3| 
on phase transitions in disordered systems states that the finite-size correlation exponent v has to satisfy the bound 
v>2/d= 2. 

In this paper, we try to clarify the nature of the low temperature phase and of the freezing transition by studying 
statistical properties of the pairing weights. 

The paper is organized as follows. The model and usual observables are recalled in Section |TT1 We then present a 
. ^ ' detailed study of the pairing weights seen by a given monomer. For clarity, the statistical properties of the weights 
^ , alone, independently of the distances involved are described in Section Hill whereas the study of spatial properties is 
5^ ' given in Section HVl We summarize our results in Section|Vl For comparison, we recall in the Appendix the properties 
5^ ^ of the weights statistics in Levy sums and in Derrida's Random Energy Model (REM), as well as the corresponding 
Derrida-Flyvberg singularities. 



II. MODEL AND OBSERVABLES 



A. Partition function 



An RNA secondary structure of a sequence of N bases (1,2,..., N) is a set of base pairs all compatible with each 
other. To be compatible, two pairs and {k,l) have to be non-overlapping (for instance i < j < k < I) or nested 
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(for instance i < k < I < j) [l|] The energy of an allowed configuration C is then the sum of the energies e^.j of all the 
pairs j) that are present in the configuration 

EiC)^ J2 (1) 

This non-crossingproperty of pairs allows to write the following recursion for the partition functions Zi j of intervals 

k—i 

The first terms represent the configurations where j is unpaired, whereas the second term represents the configurations 
where j is paired with the base k € + — Pmin } , and Pmin represents the minimal distance along the sequence 
to form a pair. So the full partition function Zi^n can be computed in a CPU time of order 0{N^). In the literature 
0) 0) B B 0) B B [iS EH , various choices for the parameter Pmin and for the distribution of the energies eij have 
been made that we will not rediscuss here. All the numerical results we will present below corresponds to the case 
Pmin = 1 ( the convergence towards the asymptotic regime iV oo is then much more rapid than for the 'biological 
value' Pmin = 4), and to the bond-disorder case, where the e^.j are independent random variable drawn with the flat 
distribution 

/9(e) = 1 for - 2.5 < e < -1.5 (3) 

The sequence length iV and the corresponding number ns{N) of independent sequences that we have studied are 
typically as follows 

N = 50,100,200,400,600,800 
ns{N) = 18.10^,2.10^3.10^25.10^6.10^,2.103 (4) 



B. Pair probabilities 

The pairing probability of bases (i,j) in the sequence (1, N) reads 



ext 



where represents the partition function of the internal sequence (z + 1, j — 1) computed in Eq. [2l and 

where Zf^* represents the partition function of the external sequence (1, 2, .., i — 1, j + 1, ...N), which can be computed 
by extending the recursion of Eq. [2]to the duplicated sequence (1, 2, .., N, 1,2, ..N): Zfj* is then given by Zj^i^N+i-i 
0. So the pair probabilities Pij can also be computed in a CPU time of order 0{N'^). 



C. Height profile 

An RNA secondary structure C can be represented as a non-crossing arch diagram or cquivalcntly as a 'mountain 
profile' (see Fig. 3 of where the height hk represents the number of pairs such that i < k < j : this height 
starts at h{k = 1) = 0, ends at h(k — N) = 0, remains non-negative in between, and the difference (/ifc+i — hk) can 
only take the three values (+1, 0, —1). Its thermal average reads in terms of the pair probabilities of Eqs. ^ 

<hk>^ J2 P^^3 (6) 

i<k<j 



D. Overlap 

In disorder-dominated phases, such as spin-glasses or directed polymers for instance, the overlap is usually a 
convenient order parameter. Here, the overlap between two-configurations Ci and C2 can be defined as 



q{Ci,C2) - ;^X!^(*j)eCil(»j)eC2 

i<j 



(7) 
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where the normalization factor N /2 represents the number of pairs existing in the ground state. The thermal average 
reads 

in terms of the pair probabilities of Eq. ([5]). 

However, this overlap is not an appropriate order parameter for RNA secondary structure, because it does never 
vanish, even at T = oo as wc now explain. 



E. Limit of infinite temperature 

In the limit T = oo, disorder disappears, and the partition function can be exactly computed 



N 



This number of possible configurations corresponds to the number of ID-random walk for the height /i^, with 3 choices 
per step for the height increment h^+i — hk — 0, ±1, and where the factor l/N^^'^ is a first-return probability. From 
this interpretation of the height as a positive random walk, it is clear that the middle height scales as N^/'^ 

< Hn > {T = oo) ^ N^/^ (10) 

For 1 <C Z ^ iV, the pair probability of Eq. [S] behaves as 

iV3/2 
Z3/2(Ar-Z)3/2 

However, the overlap (Eq. [8]) is finite 

g2(T = oo)>0 (12) 
because small pairs have a finite weight, in particular for 1 = 1, Eq. [5] yields 

,(T=oo) ^l,Af-2(r = OO) ^ J_ 

Zi.n{T = oo) 3^ 



^M-hi(^-"«)« ,3/2/.r n3/2 (11) 



F. Limit of zero temperature 

At T = 0, there is a numerical consensus [1, 0, [ll[ that the disorder-averaged height has a different scaling from 
the random walk value of Eq. [TUl 

< hN >(T = 0) ~ 7V« (14) 

where the roughness exponent 

C--0.67 (15) 

is extremely close to the simple value 2/3, although we are not aware of any rigorous or heuristic argument in favor 
of this fraction. The exponent r\ governing the scaling of large pairs 1 ^ / ^ 

(16) 

is actually directly related to the roughness exponent via the relation 0, 0, [13, [HI 

77 = 2-C (17) 

as can be seen from the definition of the height (Eq. [6|). This relation valid at any temperature corresponds to 
•r]{T = oo) = 3/2 in agreement with Eq. 1111 and to 

r^{T = 0) - 1.33 (18) 

as directly measured in [ll| . 
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G. Characterization of the transition in previous works 



Previous work has shown that there exists a finite temperature transition between a high temperature or mohen 
phase, where entropy dominates, and a low-temperature phase where disorder dominates. But very different observ- 
ables have been used numerically to characterize the transition, such as the overlap probability distribution P{q) 
d, 0, [1] , the e-coupling method Q , and the so-called 'pinch- free energy' [1, |Tl| . In the field theory of [13] , the critical 
exponents exactly at criticality were found to be the same as the ones in the low temperature phase, both for the 
height 



1 
2 

C(r < T,) = C(T - 0) (19) 



and for the overlap -Pf of large pairs I ^ 1 

P^, ^ {p-^if ~ 1 for T > T, 
1 



^M+z - - J^^) for T < (20) 

In the following, we propose to study the freezing transition via the statistical properties of the pair weights seen by 
a given monomer. In [J], the integrated probability distribution of the maximal weight Pmax seen by a given monomer 
has been measured to characterize the barrier statistics between degenerate ground states for discrete disorder, but the 
phase transition region was not studied in details from this point of view. In another context concerning disordered 
polymers [l5i |. the distribution of the maximal weight Pmax was used to analyse a phase transition, the important 
region being there the neighborhood of Pmax ^ 0, whereas in the present study, the important region is the region 

Prnax ^ 1- 



III. STATISTICAL PROPERTIES OF THE PAIR WEIGHTS 



A. Pair landscape seen by a given monomer 




j 



FIG. 1: (Color online) Pairing weight landscape (— Inwi(j)) seen by the middle monomer i = ^ for A'^ = 200 at low and high 
temperatures (a) at low temperature {T = 0.02 ), only a few weights dominate ( note the scale of In U)i{j) ), at some random 
positions (b) at high temperature (T = 0.5), the disorder is a small perturbation with respect to the entropy of the pure case 
that favors the neighbors. 



For each base (z) of the sequence of length N, we consider the {N — 1) pairing weights with the other bases {j ^ i) 
of the sequence (Eq. ^ 
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Making the convention that Wi{i) denotes the weight of the configurations where (i) is unpaired 

^ext 

w^(i) = (22) 

these weights are normahzed to 

Y,wr{j) = ^-Mi) (23) 

The pairing weight landscape seen by a given monomer is shown for two temperatures on Fig. [1] in the low- 
temperature frozen phase, only a few weights dominate in continuity with the limit of zero temperature where only 
one weight is non-zero, whereas in the high temperature phase, many weights contribute, and disorder represents a 
small random correction around the entropic term of the T — oo limit (Eq. Ilip . 

In the rest of this section, we describe the statistical properties of the weights alone, independently of the distances 
involved. The study of spatial properties is postponed to Section |IV] for clarity. 



B. Characterization of the weights statistics 

In analogy with the weight statistics in Levy sums and in the Random Energy Model [IB, [13] (we refer the reader 
to the Appendix for a summary of the most important results for our present study), we have numerically computed 
the probability distributions Pi{w) of the maximal weight 

wr"" ^niax.MMj)} (24) 

as well as P2{w) of the second maximal weight. 

Another useful way to characterize the statistical properties of the weights [13] (see Appendix) is to consider 
the moments of arbitrary order k 

that represents the probability that the monomer (i) is paired to the same monomer in k different thermal configu- 
rations of the same disordered sample. We have measured the probability n(y2) of the parameter 

Y,i^)=J2w^iJ) (26) 



as well as the moments Yk{i) for 2 < k < 100. Finally, we have also computed the density of weights 



f{w)=J2Si^-Mj)) (27) 



giving rise to the moments 



Ykii)= [ dww'^fiw) (28) 
Jo 

The normalization condition for the density f{w) is 



Yiii) = / dwwfiw) = 1 (29) 
Jo 

The properties of all these quantities in the case of Levy sums of independent variables are recalled in the Appendix. 
In the following, we describe their properties for the RNA case and discuss the similarities and differences with Levy 
sums. The numerical results for the histograms Pi{w), P2{w), f{w), 11(1^2) have been obtained by collecting the 
weights seen by each monomer i — 1,2..N in the ns{N) disordered sequences generated (see Eq. |3]for the numerical 
values used for N and ns{N)). 



6 




FIG. 2: (Color online) Probability distribution Pi{w) of the largest weight seen by a given monomer (see Eq |24|) (a) at T = 0.05 
(low-temperature phase) for A'' = 50,100,200,400 : the characteristic Derrida-Flyvbjerg singularities at ui = 1 and w — 1/2 
are clearly visible, (b) at T = 0.4 (high-temperature phase) for A'^ — 50, 100, 200 : the distribution P\{w) does not reach w — 1 
anymore, but vanish at some maximal value ■wo{T) < 1. 
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FIG. 3: (Color online) Probability distribution P\(w) of the largest weight seen by a given monomer (see Eg I24|l (a) at 
T\ ~ 0.095 where fJ^{Ti) — 1 for A*' — 50, 100, 200,400 : Pi{w) does not diverge anymore as w ^ 1 but remains finite, (b) at 
T2 = 0.15 where ii{T2) ~ 2 for = 50, 100, 200, 400 : the distribution Pi{w) vanishes linearly as w ^ 1. 

C. Probability distribution Pi(w) of the largest weight seen by a given monomer 

The probability distribution Pi{w) of the largest weight seen by a given monomer is shown on Fig. [5] for low and 
high temperatures. 

At low temperature (Fig. [2] a) , this distribution presents a divergent singularity near w — > 1 



Pi{w) cx {1 - w)"'^^^-^ 

W—>-l 



(30) 



with a temperature dependent exponent fi{T), called fi in analogy with the case of Levy sum of index /i, and with the 
random energy model where /x(T) = T/Tg (see the Appendix). However here in RNA, the pairings free-energies are 
not independent variables and are not drawn with the same distribution (as a consequence of the distances involved) , 
so the full distribution Pi{w) cannot coincide with the Levy sums result. Nevertheless, we find that Pi{w) presents 
the characteristic Derrida-Flyvbjerg singularities at w — 1/n (see Appendix ). The stronger singularity occurs at 
w = 1 and defines the exponent /i(T) ([5D|) . but the second singularity at w = 1/2 is also clearly visible on Fig. [21 
This shows that for each base in the frozen phase, all the weight is concentrated on a few pairing partners j. 

At sufficiently high temperature (Fig. [21 b), the distribution Pi{w) does not reach w = 1 anymore, but vanish at 
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some maximal value wq(T) < 1 

PiH cc {wo{T)~wy (31) 

with some exponent a, that within our numerical precision, does not depend on temperature (see below). 

So the present results for Pi{w) reveal the importance of two temperatures Ti < Tgap defined as follows. The 
temperature Ti is defined by 

^i{T^) = 1 (32) 

in Eq. [30]: for T < Ti, the probability distribution Pi{w) is divergent as w ^ 1 (as on Fig. [2] a) , whereas for T > Ti, 
the probability distribution Pi{'w) vanishes at w = 1 (see Fig. [3]b ) . Exactly at Ti, Pi{w — 1) remains finite (see 
Fig. [3] a ) . The second temperature Tgap is defined as the last temperature where Pi (w) reaches w — 1 with some 
exponent ^{Tgap). For T > Tgap, a gap appears in (Eq. [?T|) 



Wo{Tgap) = 1 (33) 

woiTgap + e) < 1 (34) 

We find that Tgap is clearly above Ti ^ 0.095 since at T2 ^ 0.15, Pi{w) still reaches w = 1 with a finite slope 
corresponding to /x(T2) ~ 2 as shown on Fig. [3]b. 



D. Probability distribution P2(w) of the second largest weight seen by a given monomer 



We show on Fig. |4]the probability distribution P2{w) of the second largest weight. The main singularities of P2{w) 
are the divergence near w ^ Q and the singularity near w (1/2)^, which is complementary to the singularity of 
Pi{w) at the same point w — > (l/2)~ [l3| (see Eq. IA19I and explanations in Appendix). For T — 0.02 (Fig. [4]a), 
there exists an infinite slope for P2{w) and Pi{w) a,s w ^ (1/2)^. For T = Ti ~ 0.095 (Fig. IHb), there exists a cusp 
for P2{w) and Pi{w) as w — > (l/2)~. 





FIG. 4: (Color online) Probability distribution Pi(ui) and P2{w) of the two largest weights seen by a given monomer (see Eq 
[241) (a) for T = 0.02 (here N = 200) there exists an infinite slope for P2(w) and Pi{w) as w ^ (1/2)" (b) for Ti ~ 0.095 (here 
A*' — 400), there exists a cusp for P2{w) and Pi{'w) as ui — > (l/2)~ 



E. Probability distribution 11(1^2) of the parameter Y2 



The parameter Y2 defined in Eq. [26] can reach the value Y2 ^ 1 only if the maximal weight w"^"'^ also reaches 
,ma£c 2. As a consequence, the probability distribution n(F2) has the same singularity near F2 — ^ 1 as in Eq. (|5D|) 

U{Y2)^c^^il-Y2r^^^-' (35) 
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FIG. 5: (Color online) Probability distribution 11(12) of the parameter Y2 (see Eg I26|l (a) at T = 0.05 (low-temperature phase) 
for A'^ = 50, 100,200,400 : the characteristic Derrida-Flyvbjerg singularities at Y2 — 1, Y2 = 1/2 and at Y2 — 1/3 are clearly 
visible, (b) at T = 0.4 (high-temperature phase) for TV = 50, 100,200 : the distribution 11(12) does not reach 1^2 = 1 anymore, 
but presents a gap. 




FIG. 6: (Color onhne) Probability distribution 11(12) of the parameter I2 (see Eq[26j (a) at Ti ~ 0.095 where fi{Ti) = 1 for 
N = 50,100,200,400 : 11(1^2) does not diverge anymore as F2 1 but remains finite (b) at T2 = 0.15 where fi{T2) ~ 2 for 
N = 50, 100, 200,400 : the distribution 11(12) vanishes linearly as 1^2 1. 



for < r < Tgap, (see Figl5]a), whereas a gap appear for T > Tgap, as shown on Fig. [5]b. 

For T < Tgap, the distribution n(y2) presents the characteristic Derrida-Flyvbjerg singularities at Y2 = 1/n (see 
Appendix ) : on Fig. [5] a, beyond the main singularity aX Y2 = 1, the secondary singularities at Y2 — 1/2 and at 
Y2 = 1/3 are clearly visible. 

The distribution n(y2) is shown on Fig. [5] for the two temperatures Ti and T2 corresponding to fJ-{Ti) = 1 and 
/i(T2) ~ 2, and can be compared with the distribution Pi{w) on Fig. [31 



F. Density f{w) 



The density /(w) introduced in Eq. (|27p is shown on Fig. [7] at low and high temperature respectively. By 
construction, this density coincides with the maximal weight distribution Pi(w) for w > 1/2, with the sum {Pi{w) + 
P2iw)) of the two largest weight distributions for 1/3 < w < 1/2, and so on [l3| (see Eq. IA19p . As a consequence, 
/(w) has the same singularity near w — *■ 1 as Pi(w) (Eq. [30]), and the same gap (Eq. [31]) as long as wo{T) > 1/2. The 
only other singularity is near w — s- where f{w) diverges in a non-integrable manner, because in the N — > 00, there 
is an infinite number of vanishing weights (only the product {wf{w)) has to be integrable at w = as a consequence 
of the normalization condition of Eq. [29]) . 
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FIG. 8: (Color online) Density f{w) of weights seen by a given monomer (see Eg I27|l as compared to fLevy{w) (Equation IA7|) 
for the corresponding value of ^ (thick curve) (a) at T = 0.02 where fi{0.02) ~ 0.13 { ioT N = 100, 200, 400 ) : f{w) is rather 
close to the density fLevyiw). (b) at T = 0.08 where ^i(0.08) ~ 0.77 (for iV = 50,100,200) : there is now a big difference 
between the density /(w) measured for RNA and the density fhevyiw). 



On Fig. [21 we compare the density /(w) measured in RNA with the density fLevy{w) (Equation IA7|) describing 
the weight statistics in Levy sums of independent variables. For small /i <C 1, the two density are rather close (Fig. 
|S]a). For larger /i, they are very different, because the density fhevyiw) disappears at the critical value — 1 (see 
the denominator of Equation I A7|) . whereas for RNA the density f{w) exists beyond ji = 1. 



G. Moments Yk{i) 



The moments Yk{i) (Eq. [25]) for various temperature are shown on Fig. [9] as functions of fc < 100. The decay for 
large k directly reflects the behavior of the distribution of the density /(w) near its maximal value, as can be seen on 
Eq. [221 For < T < Tgap, where Pi{w) and f{'w) behaves near w ^ 1 as in Eq. [301 the decay in k follow a power-law 
of exponent /J.(T) 



10 




For T > Tgap, where there exists a gap wo{T) for Pi{w), Eq. [31] also apphes to f{w) as long as wo{T) > 1/2 (since 
f{w) = Pi{'w) for w > 1/2 as mentioned above). This implies an exponential decay 

(MT)) 



Numerically, the measure of the decay of Yk(i) is the most convenient way to localize the temperature Tgap where the 
gap appears, and to measure the exponents. The temperature Tgap where the gap appears is found to be 

Tgap -T2^ 0.15 (38) 

The exponent a seems to be independent of T 

a - 0.5 (39) 

The exponent ^{T) grows with the temperature for < T < Tgap, as shown on Fig. [9]b : the temperature Ti where 
m(Ti) = 1 is 

Ti - 0.095 (40) 

In contrast with the REM (see Appendix) where ^i{T) — T/Tg is linear in the whole low temperature phase, Fig[S] 
b presents some curvature, which probably reflects the presence of some entropy. However, in the limit of very low 
temperature, the exponent /i(r) is linear in T and the coefficient depends on the droplet density as we now explain. 



H. Droplet analysis at order T in temperature 

At T = 0, there exists a unique ground-state where each monomer (i) is paired with its ground-state partner jgs{i)- 
In the droplet analysis of disordered systems [l^ [H, [l^ , various observables can be computed as first order in T in 
terms of the density p{E — 0, \)dE of two-level excitations of energy E ^ Q and size A. For instance the specific heat 
and the overlap are given in ID disordered spin chains by [l^, [23| 

C{T) J dXp{E^0,X) + OiT^) (41) 

1 - QEAiT) 2T J d\ \p{E - 0, A) + 0(T2) (42) 

i.e. the specific heat is related to the number of excitations whereas the overlap involves the number of spins belonging 
to excitations. We may apply this droplet analysis to Y]^{i) : the contribution at order T comes from the monomers i 
belonging to a droplet of energy E ^ : the pair with the ground-state partner has for weight 1/(1-1- e~^^), whereas 
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the pair with the droplet partner has the complementary weight e /(I + e ^^). Within this two-level description, 
one gets 



l-Yk{i) ^~^JdEjd\\p{E,\) 



1 - 



1 \ / e 



^c^J (^j d\\p{E = 0, A)^ h + 0{T^) 



where the integral Ik 

k-l 



Ji/2 Hi -P) ^1 



(43) 



(44) 



behaves logarithmically at large k 

Ik ~ Infc (45) 



k — *oo 



The comparison of this droplet analysis with EalMlindicates that the exponent /i(r) should increase from /i(T = 0) = 
linearly in T with a coefficient related to the droplet density 



p{T) dXXp{E = 0, A) + OiT^) (46) 



IV. STUDY OF SPATIAL PROPERTIES 

In the last Section, we have studied in details the statistics of the weights independently of the distance and 
identified two important temperatures Ti and Tgap. We now turn to the analysis various spatial properties to clarify 
the meaning of Ti and Tgap for the pair length statistics. 

A. Weight statistics for long-range pairs 

The density f{w) defined in Eq. [27]can be decomposed into /-dependent components as 

/H = E/'H (47) 

where fi{w) represents the density of weight of pairs of length I. At T = cx), these densities are concentrated on a 
single /-dependent value (see Eq. [TT]for N — > oo) 

[fi{w)]^^^^5{w-^) (48) 

whereas at zero temperature (see Eq. [TH]for TV — > oo), a weight is either (if the pair is not in the ground state) or 
1 (if the pair belongs to the ground state) 

[// H]t=o - biSiw) + ciSiw~l) (49) 
where the amplitude q of the existing weights (see Eq. [inifor N oo) decay with / as 

1 1 

« J^l^ = y03 (50) 

The densities fi{w) are plotted for various lengths / at low and high temperature respectively on Fig. [TOl At 
T = 0.05 (Fig. [TO] a), all curves present divergences at w ^ and at w ^ 1, in continuity with the two delta peaks 
present at T = (Eq. At T = 0.4 (Fig. [TO]b) all curves display an /-dependent gap, in similarity with the 

/-dependent delta peak of the T = oo limit (Eq. [48]) . 
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FIG. 11: (Color online) Densities fi{w) of weights for pair lengths I = 3,7,15,31,63 for size N = 200 (a) at Ti ~ 0.095, aU 
densities fi{w) still reach the value w — 1 (b) at Tgap = 0.15, the densities of small sizes I still reach the value w = 1, 

whereas the densities fi{w) of large sizes / don't. 



We show on Fig. [TT] the case of the two important temperatures Ti and Tgap. At Ti ~ 0.095 (Fig. [TT] a) , all 
densities fi{w) still reach the value w = 1. At Tgap ^ 0.15, the densities fiiw) of small sizes / still reach the value 
w = 1, whereas the densities fiiw) of large sizes / don't. 

These curves suggest the following picture : 

(i) for T < Ti, all densities fi{w) diverge near w ^ 1, so there exist frozen pairs of all sizes. 

(ii) for Ti < T < Tgap, there exist frozen pairs, but only of finite size. 

(iii) for T > Tgap, even short pairs are not frozen anymore. 

We now present various quantitative studies that confirm this scenario. 



B. Statistics of the distance Ipref to the preferred partner 



We now consider the probability distribution P'^'^'^ (Ipref) of the distance Ipref = \jpref{i) - 
and its preferred partner jpref{i), i.e. the monomer jpref 7^ * having the maximal weight (Eq. 
distribution coincides with the pair distribution of the ground state (Eq. [TOI 



i\ between a base i 
[2il). At T = 0, this 



T=0 



^P,,,+i{T = 0) 



(51) 
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whereas at T = oo, the maximal weight corresponds to the nearest neighbors with I = 1 for entropic reasons fEgs lllI 
andfTc 



T=CX3 



(52) 
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FIG. 12: (Color online) Ratio Bjv(r) defined in Eq. EH: (a) as a function of temperature T for sizes = 100 (O), ^ = 200 
(□), N = 400 (0), N = 600 (A), = 800 (l>). (b) finite size scaling of the same data in terms of the variable x = {T ~Tc)N^^" 
(see Eq. [56j with Tc = 0.095 and = 2 



So the first moment of this distribution 



(53) 



represents a correlation length that remains finite in the high temperature phase as N ^ oo. Since the second moment 
is also expected to be finite in the high temperature phase, it is convenient to define the ratio 



Bn{T) 



(-/pre/-) 



(54) 



which converge to in the high temperature phase, and to a non-zero value at criticality and in the low temperature 
phase. The results are shown on Fig. [T^] (a) : the critical temperature Tc coincide with Ti 



Tc = Ti - 0.095 
The finite-size scaling of these data according to 

Bn{T) ~b{{T-Tc)N^''' 

is consistent with the value 
as shown on Fig. [T^b 



We now discuss the behavior of iP'^'^f as a function of N for various temperatures. For T < it grows as 



and the probability distribution P^^^{1) follows the same scaling form as in the T — Q limit (Eq. [51]) 



(55) 



(56) 



(57) 



(58) 



(59) 
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FIG. 13: (Color online) Scaling form of the probability distribution P^'^^{1) : log-log plot of PfJ^'^^ {I) in terms of a; = l/N 
(a) for T — 0.05 (low-T phase) the rescaling is done with rj = 1.33 (see Eq. I59|l (a) for Tc ~ 0.095 the rescaling is done with 
rjc = 1.5 (see Eq. ETJ 




1.5 



> 










A 














& 

o 

□ 








(b) 


□ 












-9 


□ 





FIG. 14; (Color online) (a) iP'-^f (N) /N°-^ as a function of T for the sizes N = 100 (Q), N = 200 (□), iV = 400 (0), A'^ = 600 
(a), iV = 800 (>) (b) finite size scaling of the same data in terms of the variable x = {T-Tc)N'^^'" (see Eq. ^ with Tc = 0.095 
and 1/ — 2 



as shown on Fig. [13] a. At Tc ~ Ti ^ 0.095, the first moment grows as 



ipref ^ 

and the probability distribution Pff"^^ (l) follows the scaling form 



(60) 



1 / I 
* - 



(61) 



as shown on Fig. [T3]b. On Fig. [T3]a, the rescaled variable iP'^'^f /N^-^ is shown as a function of T for various sizes 
there is a crossing at Tc, and the data follow the finite-size scaling behavior 



Ipref 



(62) 



with v ^ 2 in agreement with the previous estimate of Eq. 1571 
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C. Pair distribution Pi^i+i and height scaling 

We have measured the median height for each sample 

hmed = ^ < hk > 



(63) 



Its average over samples is directly related to the first moment of the pair distribution Pi^i^i ( see Eq. ^ 



f'med 



(64) 



We find that in the whole low-temperature phase and at Tc, the roughness exponent is the same as at T = (Eq. [M]) 

7^~7V"-6^ for 0<T<T, (65) 

This is in agreement with Eq. [19] quoted from [l3|. Above Tc, the crossover towards the high-temperature roughness 
exponent C = 1/2 (Eq. fTU]) is well described by the following finite-size scaling form (see Fig. [15]) 



n ({T - Tc)N^^'') for T > T, 



^med 
7V0-67 

with v ^ 2 in agreement with the previous estimates of Eqs [53 and 1621 

0.35 




0.34 



H (X) 



0.32 
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(66) 









□ \ 






(b) \ 



















10 



FIG. 15: (Color online) Height scaling : (a) the curves h^^d/N°-^'^ of various sizes TV = 50 (*), iV = 100 (Q), N = 200 (□), 
N = 400 (0), N — 600 (a) present crossings shifting towards Tc (b) finite size scaling of the same data in terms of the variable 
X = (T - Tc)N^^'' (see Eq. [Ml with Tc = 0.095 and = 2 



Accordingly, we find that the pair distribution Pi^i+i follows the T = finite-size scaling of Eq. [16] in the whole 
low-temperature phase and also at Tc 



(Tc) 



-<I> 



7V1-33 V N 



(67) 



D. Overlap Pf^j^i of large pairs I 



We have also measured the overlap ^Yj+z '^^ large pairs. We find that the first moment scales as 



dl I Pf,+i 



N°-^'^ for 0<T <Tc 
iV°-5 for T = Tc 
cte for T > Tr 



(68) 
(69) 
(70) 
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The scaling exactly at is distinct from the low-temperature phase in disagreement with Eq. [20] quoted from [l^l , 
but coincides with the scaling found above for P'^'^f (Eq. [60]). It is thus convenient to define the ratio 



Rn{T) = 



JdllPl 



i+l 



J dl I Pi^i+i 



(71) 



which converge to in the high temperature phase, and to a non-zero value in the low temperature phase. Exactly 
at Tc, it is expected to decay as N°-^/N°-^'^ = N^°-^'^. On Fig. [Il](a), the curves N°-^'^Rn{T) present crossings that 
shift regularly towards Tc. The finite-size scaling of these data according to 



with Tc = 0.095 and = 2 is shown on Fig. [TBlb 



(72) 
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FIG. 16: (Color online) Critical behavior of the ratio Rn{T) defined in Eq. [TT]: (a) curves Af'' "i?jv(r) for the sizes = 50 
(*) iV = 100 (O), N = 200 (□), TV = 400 (0), N = 600 (A), = 800 (>) (b) finite size scaling of the same data in terms of 
the variable x = (T - Tc)N^''' (see Eq. [72J with Tc = 0.095 and = 2 



V. SUMMARY AND CONCLUSIONS 



In this paper, we have analyzed the freezing transition of random RNA secondary structures via the statistics of 
the pairing weights seen by a given monomer. In analogy with Levy sums and Derrida's Random Energy Model 
we have numerically computed the probability distributions Pi^w) of the maximal weight, the probability 
distribution n(l2) of the parameter Y2{i) = J2j '^iU)i '^'^ll as the average values of the moments Yfe(i) = J^j '^^{j)- 
We have found two important temperatures Tc < Tgap. For T > Tgap, the distribution Pi{w) and n(l2) have a gap 
and, accordingly, the moments Yk{i) decay exponentially in k. For T < Tgap., these moments decay with a power-law 
Yk{i) ^ l/kf^^'^\ and the distributions Pii^w) and 11(^2) present the characteristic Derrida-Flyvbjerg singularities at 
w = 1/n and Y2 = l/n for n = 1,2... The most important singularities occur at w = 1 with Pi{w) ~ (1 — 'w)f^^'^^~^ 
and 11(1^2) ~ (1 — F2)^^"^''~^- The exponent ^(T) increases with the temperature from the value fj,{T = 0) = up to 
fj,(Tgap) ~ 2. The study of spatial properties indicates that the critical temperature Tc where the large-scale roughness 
exponent changes from the low temperature value C ^ 0.67 to the high temperature value C ~ 0.5 corresponds to the 
exponent fJ.(Tc) = 1. The final picture is thus as follows : 

(i) for T < Tc, there exist frozen pairs of all sizes 

(ii) for Tc < T < Tgap, there exist frozen pairs, but they are of finite-size. 

(iii) for T > Tgap, even short pairs are not frozen anymore. 

Finally, the finite-size scaling of various data are consistent with a correlation length exponent ~ 2 that saturates 
the general bound v >2/d= 2 of [l3| for phase transitions in disordered systems. 

In conclusion, the numerical study of the weight statistics appears as an interesting tool to clarify the nature of low 
temperature phases existing in disordered systems. In particular, we have shown that the frozen phase is characterized 
by a temperature-dependent exponent fJ.{T) that governs the broadening of the delta peak existing at w = 1 at T = 0. 
We intend to study in a similar way other disordered models |21|] . 
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APPENDIX A: REMINDER ON LEVY SUMS, THE RANDOM ENERGY MODEL AND 

DERRIDA-FLYVBJERG SINGULARITIES 



1. Levy sums when the first moment is infinite 

In this section, we recall some results on the weight statistics [l^ for the case of Levy sums 

N 



i=l 



of N positive independent variables (xi, ..xn) distributed with a probability distribution that decays algebraically 

with < /i < 1, i.e. when the first moment diverges < x >= +oo. The sum Sn then grows as N^^'^, and the rescaled 
variable is distributed with a stable Levy distribution [22] . Another important property is that the maximal variable 
Xmax{N) among the N variables (xi, ...xjv) is also of order N^^f^, i.e. the sum Sm is actually dominated by the few 
biggest terms [la, US] • To quantify this effect, it is convenient to introduce the weights 

m = f- (A3) 

and their moments 

JV 



Yk = J2^' (A4) 



8=1 

In particular, their averaged values in the limit iV — > oo are finite for < /i < 1 and reads [lB| 

—Levy _ r(fc-/i) . , . 

- r(fc)r(i - ^,) 

The density /(w) giving rise to these moments 

T^^^''^ = r dww'^fiw) (A6) 
Jo 

reads 

= ro.)r(i-M) ^^'^ 

and represents the averaged number of terms of weight w. This density is non-integrable as w ^ 0, because in the 
limit TV — > c», the number of terms of vanishing weights diverges. The normalization corresponds to 



-Levy 



Yk=r' = / dwwf{w) = 1 (A8) 
Jo 

More generally, correlations functions between Yfe can also be computed fio'l, and the joint density of K weights 
reads [13] 
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2. Reminder on the Random Energy Model 



The Random Energy Model (REM) introduced in the context of spin glasses [23j is defined by the partition function 
of N spins 



(AlO) 



where the (2 ) energy levels are independent random variables drawn from the Gaussian distribution 

PNiE) = ^e-* 



(All) 



It turns out that the low temperature phase < T < Tg of the REM [1^, [2J| is in direct correspondence with Levy 
sums of index < u = < 1 : the weights 



-/3-Eq 



(A12) 



have exactly the same moments Yfc (Eq. lASp and the same density /(w) (Eq. IA7I1 . The explanation is that the lowest 
energy in the REM are distributed exponentially 



Pextremal i-E) — ^ 



-yE 



(A13) 



This exponential form that corresponds to the tail of the Gumbel distribution for extreme- value statistics [25|, |20| , 
immediately yields that the Boltzmann weight x = e~^^ has a distribution that decays algebraically (Eq. IA2p with 
exponent 



^1 = T-f 



(A14) 



In the REM, the coefficient 7 in the exponential (Eq. IA13P is 7 = l/Jg- 

The link with the thermodynamics is that the entropy Sn remains finite as N grows and is given in terms of the 
weights by [23| 



TV 



Sn{T < Tg) ^ - '^Wi \n{wi) 



N 



[dkYk]„^, = r'(i) 



r'(i-MT)) 
r(i-/i(T)) 



(A15) 



and the corresponding specific heat Cn{T <Tg) = TdrSisiiT) then coincides with the finite-size result computed in 
[2^. So the entropy per spin Sn/N and the specific heat Cn/N vanish as ^ 00 in the whole low-T phase. In the 
critical region T 



Tg , the finite-size scaling behavior is 



Cn{T) 
N 



1 



N{Tg - Tf 



(A16) 



As a final remark, let us mention that in the mean-field Sherrington-Kirkpatrick (SK) model of spin-glasses, the 
same expressions of Yfc (Eq. lASP also appear 2^ , but with a different interpretation : the weights are those of 
the pure states. As a consequence, the parameter /x(T), which is a complicated function of the temperature, vanishes 
at the transition uiTr) = (only one pure state in the high temperature state) and grows as T is lowered towards 
/x(r = 0) of order 0.5 [23|. This is in contrast with the REM model where fJ,{T) = T/Tg grows with the temperature 
from fi{T = 0) = (only one ground state) to fJ-(Tg) = 1 at the transition, where the number of important microscopic 
states is not finite anymore. Nevertheless, the expression (Eq. lASP for the weights of pure states means that the 
free-energy / of pure states in the SK model is distributed exponentially 



P(/) ^ e 
/-+-00 



7(T)/ 



(A17) 



with a parameter 7(T) = fi{T)/T. 
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3. Derrida-Flyvbjerg singularities 
In [l3|, the statistics of the weights Wi normahzed to 

^ = 1 (A18) 

i 

for Levy sums with < /i < 1 or equivalently of the REM or SK model have been studied in details. In particular, 
the probability distributions Pi{w) of the maximal weight w™""^ = maxi[wi], P2{w) of the second maximal weight, 
and n(y2) of the parameter Y2 = X^i '"^^ present singularities at w = 1/n and Y2 = l/n for n = 1,2.. : this shows 
that all the weight is concentrated on a few terms. 

The origin of these singularities is that the density of weights given in Eq. (|A7|) satisfy 

f{w) — Pi{w) for - < u) < 1 

f{w) =Pi{w)+P2{w) for \ <w<^ 

f(w) = Pi{w) + ... + PJw) for <w <- (A19) 

n + 1 n 



For w ^ 1, Pi{w) thus presents the same singular behavior as f{w) of Eq. (|A7p 

P^{w = l~e) oc^e^"^ (A20) 
For w 1/2, the singularity of Pi{w) comes from the two different expressions of Eas lA19l 



Pi(zi; = i + e) - Pi(«; = i - e) = /(^i; = i + e) - 



f{^ = \-^)-P2{\~e) 



cxP2(i-e) (A21) 



For 1/3 < W2 < wi, the joint probability that the two largest weights are Wi and W2 is given by Eq. IA9I for K = 2 

f{wuw2) = f2(Y3^)f(^^r'"''^2"'"^(l-"'i -^2)'^-^ (A22) 
and thus ^2(^2) reads for 1/3 < W2 < 1/2 



^^2^) = j rf^;i/K,^;2)^ ^2(i_''^)r(2^) ^2'''''y dwiw^^-^{l-wi-W2f^-^ (A23) 



The singularity near W2 1/2 is thus of order 

^2 (^W2 = ^ - oc dwiQ + e-i«i^ oce^M (A24) 

So for wi,W2 1/2, Pi(wi) and P2{w2) have a singularity of order e^^, i.e. there is no divergence, in contrast with 
the singularity near w ^ 1, but there is an infinite slope for < u < 1/2. More generally, the singularities of Pi{wi) 
at wi = l/n — e are weaker and weaker as n grows according to [17| 

Pi (^wi = ^ -e^ oc £2/^-2+" (A25) 

Similarly, for the distribution 11(12), the singularities are given by [l3| 



n(y2 = ^-e) oce"(^+5)-* (A26) 



.3 



In particular, near Y2 1, 11(12) exhibits the same divergence as Pi{w — > 1) 

n (^2 = 1 - e) cx e"-^ (A27) 
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and near Y2 1/2, the singularity 

n(^y2 = ^-e) oce^M-^ (A28) 

corresponds to an infinite slope for < /i < |. We refer the reader to [17| for more details. The shapes of Pi (it;), 
P2{w) and 11(12) can be found for = 0.3 and fj, = 0.1 on Figs 3 and 4 of [l3| respectively. 

Similar Derrida-Flyvbjerg singularities describe above for the case of Levy sums or spin-glasses (REM or SK), 
actually occur in many other contexts, such as randomly broken objects [l3,[33|j in population genetics [Sll. [3^. [SSj . 
in random walk excursions or loops [la, [H, . 
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